AITopics | eiffel tower

Collaborating Authors

eiffel tower

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

6f1d43d5a82a37e89b0665b33bf3a182-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 05:46:27 GMT

liberty island, rome, sonic drift 2, (11 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > Scotland (0.05)
Europe > Albania > Tirana County > Tirana (0.04)
Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.04)
(17 more...)

Genre:

Research Report > New Finding (0.68)
Personal (0.46)

Industry:

Leisure & Entertainment > Sports (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.51)

Add feedback

Langevin dynamics, the samples contain small Gaussian noise that

Neural Information Processing SystemsAug-22-2025, 00:26:51 GMT

We thank all the reviewers for providing valuable feedback in this time of stress. We will include these new results in the revision. NCSN (CIFAR-10) NCSNv2 (CIFAR-10) NCSN (CelebA) NCSNv2 (CelebA) FID 27.44 10.31 17.57 9.69 [R1] Is the model memorizing data (like the Eiffel towers in Figure 1)? NCSNv2 uses the new architecture and the others use the old one. We will incorporate your suggestions in the revision.

dataset, langevin dynamic, small gaussian noise, (15 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.36)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Appendices A Solving for Algebraically

Neural Information Processing SystemsAug-15-2025, 16:55:56 GMT

Causal traces show that the last token of the subject name is not always decisive.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > Scotland (0.05)
Europe > Albania > Tirana County > Tirana (0.04)
Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.04)
(17 more...)

Genre:

Research Report > New Finding (0.68)
Personal (0.46)

Industry:

Leisure & Entertainment > Sports (1.00)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.73)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.51)

Add feedback

Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs

Saha, Swayamjit

arXiv.org Artificial IntelligenceJul-8-2025

Large Language Models (LLMs) are powerful yet prone to generating factual errors, commonly referred to as hallucinations. We present a lightweight, interpretable framework for knowledge-aware self-correction of LLM outputs using structured memory graphs based on RDF triples. Without retraining or fine-tuning, our method post-processes model outputs and corrects factual inconsistencies via external semantic memory. We demonstrate the approach using DistilGPT-2 and show promising results on simple factual prompts.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.04625

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure

Budding, Céline

arXiv.org Artificial IntelligenceApr-17-2025

It is sometimes assumed that Large Language Models (LLMs) know language, or for example that they know that Paris is the capital of France. But what -- if anything -- do LLMs actually know? In this paper, I argue that LLMs can acquire tacit knowledge as defined by Martin Davies (1990). Whereas Davies himself denies that neural networks can acquire tacit knowledge, I demonstrate that certain architectural features of LLMs satisfy the constraints of semantic description, syntactic structure, and causal systematicity. Thus, tacit knowledge may serve as a conceptual framework for describing, explaining, and intervening on LLMs and their behavior.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1017/psa.2025.19

2504.12187

Country:

North America > United States > Massachusetts (0.28)
Europe > United Kingdom > England > Oxfordshire (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Question-to-Question Retrieval for Hallucination-Free Knowledge Access: An Approach for Wikipedia and Wikidata Question Answering

Thottingal, Santhosh

arXiv.org Artificial IntelligenceFeb-7-2025

This paper introduces an approach to question answering over knowledge bases like Wikipedia and Wikidata by performing "question-to-question" matching and retrieval from a dense vector embedding store. Instead of embedding document content, we generate a comprehensive set of questions for each logical content unit using an instruction-tuned LLM. These questions are vector-embedded and stored, mapping to the corresponding content. Vector embedding of user queries are then matched against this question vector store. The highest similarity score leads to direct retrieval of the associated article content, eliminating the need for answer generation. Our method achieves high cosine similarity ( > 0.9 ) for relevant question pairs, enabling highly precise retrieval. This approach offers several advantages including computational efficiency, rapid response times, and increased scalability. We demonstrate its effectiveness on Wikipedia and Wikidata, including multimedia content through structured fact retrieval from Wikidata, opening up new pathways for multimodal question answering.

artificial intelligence, natural language, question answering, (16 more...)

arXiv.org Artificial Intelligence

2501.11301

Country:

Europe > France (0.14)
North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > Ukraine > Kyiv Oblast > Chernobyl (0.05)
(6 more...)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Law (0.71)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Add feedback

MapQaTor: A System for Efficient Annotation of Map Query Datasets

Dihan, Mahir Labib, Ali, Mohammed Eunus, Parvez, Md Rizwan

arXiv.org Artificial IntelligenceDec-30-2024

Mapping and navigation services like Google Maps, Apple Maps, Openstreet Maps, are essential for accessing various location-based data, yet they often struggle to handle natural language geospatial queries. Recent advancements in Large Language Models (LLMs) show promise in question answering (QA), but creating reliable geospatial QA datasets from map services remains challenging. We introduce MapQaTor, a web application that streamlines the creation of reproducible, traceable map-based QA datasets. With its plug-and-play architecture, MapQaTor enables seamless integration with any maps API, allowing users to gather and visualize data from diverse sources with minimal setup. By caching API responses, the platform ensures consistent ground truth, enhancing the reliability of the data even as real-world information evolves. MapQaTor centralizes data retrieval, annotation, and visualization within a single platform, offering a unique opportunity to evaluate the current state of LLM-based geospatial reasoning while advancing their capabilities for improved geospatial understanding. Evaluation metrics show that, MapQaTor speeds up the annotation process by at least 30 times compared to manual methods, underscoring its potential for developing geospatial resources, such as complex map reasoning datasets. The website is live at: https://mapqator.github.io/ and a demo video is available at: https://youtu.be/7_aV9Wmhs6Q.

api, dataset, information, (15 more...)

arXiv.org Artificial Intelligence

2412.21015

Country:

Asia > South Korea (0.04)
Asia > Middle East > Qatar (0.04)
Asia > Bangladesh (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.76)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.49)

Add feedback

FactAlign: Long-form Factuality Alignment of Large Language Models

Huang, Chao-Wei, Chen, Yun-Nung

arXiv.org Artificial IntelligenceOct-2-2024

Large language models have demonstrated significant potential as the next-generation information access engines. However, their reliability is hindered by issues of hallucination and generating non-factual content. This is particularly problematic in long-form responses, where assessing and ensuring factual accuracy is complex. In this paper, we address this gap by proposing FactAlign, a novel alignment framework designed to enhance the factuality of LLMs' long-form responses while maintaining their helpfulness. We introduce fKTO, a fine-grained, sentence-level alignment algorithm that extends the Kahneman-Tversky Optimization (KTO) alignment method. Leveraging recent advances in automatic factuality evaluation, FactAlign utilizes fine-grained factuality assessments to guide the alignment process. Our experiments on open-domain prompts and information-seeking questions demonstrate that FactAlign significantly improves the factual accuracy of LLM responses while also improving their helpfulness. Further analyses identify that FactAlign is capable of training LLMs to provide more information without losing factual precision, thus improving the factual F1 score. Our source code, datasets, and trained models are publicly available at https://github.com/MiuLab/FactAlign

factuality, hague convention, language model, (10 more...)

arXiv.org Artificial Intelligence

2410.01691

Country:

Europe > Netherlands > South Holland > The Hague (0.06)
Asia > Singapore (0.04)
Europe > Bosnia and Herzegovina (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Government > Military (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Answer When Needed, Forget When Not: Language Models Pretend to Forget via In-Context Knowledge Unlearning

Takashiro, Shota, Kojima, Takeshi, Gambardella, Andrew, Cao, Qi, Iwasawa, Yusuke, Matsuo, Yutaka

arXiv.org Artificial IntelligenceOct-1-2024

As large language models (LLMs) are applied across diverse domains, the ability to selectively unlearn specific information has become increasingly essential. For instance, LLMs are expected to provide confidential information to authorized internal users, such as employees or trusted partners, while withholding it from external users, including the general public and unauthorized entities. In response to this challenge, we propose a novel method termed ``in-context knowledge unlearning'', which enables the model to selectively forget information in test-time based on the context of the query. Our method fine-tunes pre-trained LLMs to enable prompt unlearning of target knowledge within the context, while preserving other knowledge. Experiments on the TOFU and AGE datasets using Llama2-7B/13B and Mistral-7B models show our method achieves up to 95% forgetting accuracy while retaining 80% of unrelated knowledge, significantly outperforming baselines in both in-domain and out-of-domain scenarios. Further investigation into the model's internal behavior revealed that while fine-tuned LLMs generate correct predictions in the middle layers and maintain them up to the final layer, they make the decision to forget at the last layer, i.e., ``LLMs pretend to forget''. Our findings offer valuable insights into enhancing the robustness of unlearning mechanisms in LLMs, setting a foundation for future research in the field.

in-context knowledge, information, knowledge, (14 more...)

arXiv.org Artificial Intelligence

2410.00382

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
North America > United States > Washington > King County > Seattle (0.04)
Europe > Monaco (0.04)
(2 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology (0.46)
Law (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Enhancing Incremental Summarization with Structured Representations

Hwang, EunJeong, Zhou, Yichao, Wendt, James Bradley, Gunel, Beliz, Vo, Nguyen, Xie, Jing, Tata, Sandeep

arXiv.org Artificial IntelligenceJul-20-2024

Large language models (LLMs) often struggle with processing extensive input contexts, which can lead to redundant, inaccurate, or incoherent summaries. Recent methods have used unstructured memory to incrementally process these contexts, but they still suffer from information overload due to the volume of unstructured data handled. In our study, we introduce structured knowledge representations ($GU_{json}$), which significantly improve summarization performance by 40% and 14% across two public datasets. Most notably, we propose the Chain-of-Key strategy ($CoK_{json}$) that dynamically updates or augments these representations with new information, rather than recreating the structured memory for each new source. This method further enhances performance by 7% and 4% on the datasets.

dataset, information, paragraph, (15 more...)

arXiv.org Artificial Intelligence

2407.15021

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
North America > Canada > British Columbia (0.04)

Genre:

Research Report (0.70)
Overview (0.47)

Industry: Consumer Products & Services (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback